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ABSTRACT 



r 



Correlation, regression, and score interval analyses were conducted for six academic 
measures as predictors, of essay writing and overall perf ormance-.J' Comparisons for all 
analyses were made for men, women, Asians, blacks, Hispanics, and whites. The 'cor- 
relational comparisons showed few differences across gr'oups, except that correlations 
tended to be lower for the white sample because of variance'restr ictions. The re- 
gression comparisons agreed with previous studies showing blacks and Hispanics to bfe 
generally overpredicted . On essay-writing performance, meh were also overpredicted 
by conventional basic skills measures. In contrast, women tended to write better 
essays than would have beeh predicted by conventional basic skills measures. False 
negatives, thosfe people who score JLow on a prfedictor but who excel on a criterion, 
occurred least in the' black and Hispanic groups. ^ 
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INTRODUCTION .J ^ . 

K • ' » 

Because of contemporary concerns about fairness t;o sex and ethnic groups, it has 
become customary to conduct analyses of academic tests in 'use or proposed for use to 
determine if any unfairness may result from such use. A previous study (Breland, 
1977b) reported a number gf such analyses for the test of Standard Writtea English * 
(TSWE) , but this report was limited by the available data. Because of the rela- 
tively small number of minority students in the* sample, it was necessary to combine 
all minorities into one group for analysis. The present report does not have that 
limitation. 

The most common approach to the examination of fairness is to compare correla- 
tional and regression ana-Lyses for* ^t,he various gVoups' of interest. However, these 
kinds of analyses do not reveal cer.tain kinds of potential unfairness. For example, 
they do not examine specific groups* of pejople who may score low on a predictor but 
who perform satisfactorily if allowed to enter an educational program. Such people 
have been termed "false negatives." Conversely, there exist groups of students wh'o 
score high on a predictor but who do not perform satisfactorily later. These have 
been called "false positives." When false positives and false negatives are summed, 
they represent "misses" — people for whom an erroneous prediction was made. Cor- 
relational and regression analyses may not reveal the actual consequences repre- 
sented in false nega^tives and false j^osit^ives. For that reason it is important to 
examine the bivariate distributions of predictor scored and actual performance 
measures to .understand better the potential for unfairness in the use of tests. 

A second issue m fairness investigations is. whether the outcome being predicted 
is reliable and unbiased. This concern is commonly referred to as the "criterion 
problem." The criterion usually used in prediction studies of fairness is the grade 
point average (CPA). Another useful criterion is performance on some task quite 
different from the predictor. For example, essays written in English composition 
courses can be used as a criterion- for comparing performances U>r different groups 
on multiple-choice tests intended for the prediction of college performance. This 
kind of criterion has the advantage that it is usually more objective than CPA or 
specific college course grades - especially w.hen essay scores are assigned by 
readers remote from the instructional setting. 

Previous studies of some of the same basic skills measures investigated in the 
present study have been reported by a number' of writers. Breland (1977a) reported 
an investigation of the Test of Standard Written English (TSl^) and^its use in 
college English placement for four institutions. The institutions provided d'ata ^ 
on student performance during college, and Educational Testing Service -matched these 
data with other data obtained at the time students applied for college. Student 
performances on essay tests of writing ability were shouna to have a strong relation- 
ship to student performance on multiple-choice tests of wilting ability as repre- ^ 
sented by the TSWE. Second, the analyses\i|ndicated that a brief multiple-choice 
test of writing ability predicted actual yfiting performance during the freshman 
year of college as well as or better than a brief essay test given at t^e beginning 
of the freshman year. J - ^ 

V - - 

As noted above, Breland (1977b) compared men, women, minorities, and non- 
minorities with respect to performance on the TSWE and subsequent performance in 
English composition courses as well as performance on brief impromptu essays. No 
important group differences in traditional correlational analyses for either grade 
or* essay pr-ediction were observed. Analyses of correct and incorrect placement de- 
cisions (hits and misses), at specific TSWE cutoff scores revealed no noteworthy group 
differences whether outcomes were based on English course grades or on freshman 
writing performance. From all groups* the TSWE appeared to predict freshman-year 
writing performance as well' as or better than precourse writing samples, high school 
English grades, or high school rank in class. Because of the limited number^of cas^ 
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available for analysis, however, it was not possible to conduct analyses wittiin 
minority subgroups. 

Bianchini (1977) presented solne basic data for the August 1977 administration of 
the English Placement Test .(EPT) . These basic data included means, standard devia- 
tions, KR-20 reliabilities, inter-reader correlations for the essay, and speed and 
power analyses. Rankin (19^8) visited the 19 campuses of the California State 
Universities and Colleges (CSUC) using the EPT and reported on its use. This study 
was conducted to determine how the EPT was beiHig used at CSUC campuses. Several 
recommendations were made for improvements in instruction and placement. Dunbar,/ 
Minnick, and Oleson (1978) collected data for the Hfiyward campus for the purpose of 
examining the diagnostic capabilities of the EPT, especially for students with 
minimum writing proficiency. They conducted correlational analyses, multiple je- 
g^ession analyses, and discriminant analyses. These analyses indicated that the EPT 
essay score contributed significantly to the prediction of course performance but 
that the contribution was not large. Some questions were raised concerning the 
coat-effectiveness* of the EPT as a result of high observed dortela t ions between 
EPT and other measures available. ^ 

Michael and Shaffer (1978) collected data at the North^idge campus of CSUC and 
conducted extensive analyses. The purpose of these analyses was to examine compara- 
tive validities of the* EPT ^nd other measures available for placement as well as 
comparative validities for different groups. As did Dunbar et al., Michael and 
Shaffer suggested that there were substantial intercocrelat^Lons among tests studied. 
Correlations were also reported between test scores anti fall-semester CPA and grades 
in a freshman English course. The»EPT-Total seore correlated .30 with (?PA and .47- 
with English grades. These correlations were slightly higher, as would be expected, 
than those for the shorter SAT-y test: .27 with GPA and .41 with English grades. 
One would expect greater reliability, and consequently higher correlations, using 
the longer test. Mixed results were obtained from analyses of small groups cf blacky 
and Hispanic students, as some results suggested good predictability of coutse per- 
formance and others did not. 

In another study Bailey (1978) conducted a factor analysis in which SAT sub- 
scores, the TSWE, and EPT subscores were entered. The primary objective in this 
analysis was to determine if both sets of tests (the EPT and* the SAT-TSWE set) 
would load on a common factor or whether unique properties might be posses'sed by 
either. A strong common factor was identified Accounting. for 80 percent of the 
variance from all these tests. It was suggested that all the tests have many 
common jiroperties and that the EPT and the SAT* ( including TSWE) may be measuring ^ 
similar abilities. 



Instruments • 

This report pxesents comparative analyses of sev^n academic tests: the Scholast^ij: 
Aptitude Test (with its verbal and mathematical sections considered separately), ^ 
the Test of Standard Written English (TSWE), and fQur subtests of the English 
Placement *Test (EPT). The four subtests are EPT-Reading, EPT-Senten<;fe Construction, 
EPT-togic and Organization, and EPT-Essay. Each^ of the tests is described briefly 
in the following paragraphs. 

SAT-Verbal. This section of the SAT is made up of two verbal sections, each one ^ 
/ requiring 30 minutes. The questions measure the ability to understand what has 

been read, as well as the extent of vocabulary development. SAT-Verbal scores are 

reported ^n a scale with a range of 200-800. The national mean for 1980 college- / * 

bound seniors was 424, with a standard deviation of 110. 

SAT-Mathematical . This has two mathematical sections, each one requiring 30 minutes. 
The questions measure problem-solving abilities closely related to college work. 




SAT-Mathematical scores are also reported on a 200-800 scale. The national mean for 
1980 coHege-bound serviors was 466, with a standard deviation of 117. 

TSWE. The Test of Standard Written English is a 30-ininute^ multiple-choice test 
administered with th^ SAT. The questions ev^luate^the ability to recognize standard 
written English, the 'language of most college textbooks, and the English that 
college students are usually exp^ted to use. in papers they write for courses. 
TSWE scores are reported on a scale of 20 to 60+, with 60+ representing all scores 
of 60 and .above. The national m^an for 1980 college-bound seniors was ^2.«^», with a 
standard^ deviation of 11.0.' 

EPT-Reading . A 50-item, 40-minute^ test designed*^ to measure , several reading skills ^ 
such as identifying the main idea, interpreting directly or indireotly stated ideas, 
inferring the meaning of words within the context of a reading passage, and recog- 
nizing, levels of meaning througl) figurative phrase3. Scores have a range from 120 
to 180, with a mean df 150 and a standard deviation of 10.' 

EPT-Sentence Construction , This is a 50-item, 40-minute test designed to measure 
how well students can, recognise arrrangements of sentence elements that express the 
meaning clearly and correctly. The test has the same scale as EPT-Reading. 

EPT-Logic and Organization . This is a 50-item, 40-minute test designed to measure 
the ability to see relationships between words, sentences, objects, and ideas. The 
test has the same scale as EPT-Reading and EPT-Sentence Construction. 

EPT-Essay . A 45-minute essay is written in response to a s^lecial topic presented. . 
The essay is scored on a six-point scale by two readers, independently, and. these'* 
two scores are added to yield *a score in the range from 2 to 12. In cases of ex- 
treme disagreement between readers, a third reader is used to resolve th*e difference. 



Procedures 

This report presants comparative analyses of seven academic tests for six different 
groups: men, women, Asians, blacks, Hispanics, and whites. A principal objective 
in the analyses was to avoid the use of GPA or of individual course grades as 
critertia, since these are subjective measures and are often biased. Because of the 
•objective nature of writing assessment throug\i multiple judgments by readers of 
essay samples, the EPT-Essay score based on a 45-minute writing sample was viewe<i. ^ 
as a good standard for comparing other basic skills measures available. Addition- 
ally, the EPT-Total score (which includes the essay) was considered a, good standard 
for some analyses because of the large amount of information represented in it. 

Correlational analyses were conducted to show basic relationships at^ong measures 
for the total sample available and for subgroups as well. Regression analyses were 
conducted to explore further the nature of^the relationships represented i^f snimnary 
form in the correlations. The regression analyses were essentially intended- to 
determine whether yithin-group regression lines differed signi f icantl^f for different 
groups. 

Because correlational/ regression" analy&es ''as commonly conducted make assumptions 
of linearity in relationships, analyses were also made at different ^core 'levels as 
a check on the other analyses* The a^ialyses within score lev€%s, addressed the 
quesj:ion; What is the expectation of performance on a criterion measur^|iven 
performance within a certain rapge on the predictor measure? ^, ^ 



1. Recent versions of EPT tests require slightly less'te^ting time. 

2. The analyses are based on data collected by €he (^al if or^a^ State 
InJ^versities and Colleges (CSUC) for students entering in 1977 and matched 
with College Board data provided by Educational Testing Service. 



Cprrelaplonal Analyses^ 

Xables 1 through 4 show correlations-^ between the EPT components - the EPT-Essay, the 
'EPT-Reading score, the EPT-Sentence Construction score, the EPT-Logic ai>d Organizatioti 
score — and the other available scores. Table 1 would suggest that the EPT-Esfeay 
score is about equally well predicted by the other EPT components, the TSWE, and che 
SAT-V. Only the SAT-M correlation^ with the 'EPT-Essay score are noticeably lower, as' 
would f>Q expected. This trend is consistent for all groups, except whites. Inter- 
estingly, the white sample correlations are lowest of all. The lower white corjjp- 
lations are at, least in part attributable *to the lower white standard deviation (1.71) 
on the EPT-Essay score. This is somewhat lower than the standard deviations for 
Asians (1.93), blacks (1.86) , and Hispanics (1.94) on the EPT-Essay. Similar differ- ^ 
ences* in variances occurred for most of tTie other measures studied, and these would 
further reduce correlations attainable for whites.^ 

t 

All the correlations are attenuated somewhat^, however, by the generally low re- 
liability of writing sample assessment as represented in the EPT-Essay. As noted in 
Table 1, the reliability of a single essay scored independently by two readers was 
estimated at .38 by Coffman (1966). Breland and Gaynor" (1979) used a longitudinal 
design to obtain a/^st^etest reliability estimate of .52 for a similar ^^direct 
(writing sample) assessment. But Werts, Breland, Grandy, and Rock (1980>' conducted 
analyses suggesting that the test-retest estimate of .52 may be an overestimate 
because of the potential of correlated errors of measurement resulting from the 
writing sample securing. That is, readers may be systematically influenced by 
extraneous factors (possibly handwriting quality, neatness, and similar influences). 
, These analyses suggest that the reliability of essay assessment with one sample and 
cwo readers is probably in ^he range from about .35 to about .45. 

Table t shows that the correlational relationships increase somewhat when the 
multiple-choice EPT-Reading score is .related to College Board scores (TSWE, SAT-V, 
SAT-M). SAT-V correlated best with EPT-Reading for all groups. As before, the 
lowest SAT-V correlations were obtained for the white sample. 

In Table 3, the EPT-Sentence Construction score i/ correlated with the College 
Board scores. As with tjhe EPT-Reading score in Table 2, these co^elations are 
generally higher than with the EPT-Essay score. The difference between the EPT- 
Reading and EPT-Sentence Constroiction scores is that, while the former tended to 
colrrelate best with SAT.-V, the latter tended to correlate best with TSWE. That is, 
of course, what one would expect. ^ 

Table 4* shows the correlat ionOvL relationships bet^en the EPT-Logic and Organi- 
zation test and the College Board scores. Here the SA'^V appears to be the best 
correlate, but the white sample, as before, yielded the lowest corr^elations. 
* 

Table 5 relates the composite of the EPT components (the EPT-Tocal score) and 
the College Board scores. ' These correlations exhibit the same pattern as obtained 
for the EPT components buc are generally higher because of the greater reliability 
of the EPT-Total score. 

In Table 6, EPT and College Board scores 4re first separatelj^ combined to 
generate a comparative* prediction of the essay score for all groups. Second, the 
EPT-Total score is predicted from the multiple of the College Board scores. These 
weighted composites appear to predict the EPT-Essay criterion equally well, with 
only»minor variations for some groups. The white sample essay performance is by 
far the most difficult to predict, and this finding is consistent with all the other 
correlational analyses. Ip the analyses ^f Table 6, It should be noted that SAT-M 

^3? See Appendix A, for further details. Note also that in a number of the 
tables in this report figures for ethnic groups do not add up to the total because 
identification is not available for all cases. 
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*TABLE tV-'-€ar^:ei.acions wich the EPT-Essay Score 



Correlations between the EPT-Essay Score and: 



EPT- EFT- 
EFT- Sentence Logic and 

Construction • Organization 



TSWE 



SAT-V SAT-M 



Group 




Score 


Score 


Score 


Secure 


Score 


^ Score 


Tot^l 


10,67)^ 


.46 


.49 


.46 


^ J_ 


_.^45 


.23 


Men . 


^4,766 


.43 


.48 


:>.45 


.48 


.44 


.29 


Women 


5,908 


.49 


.51 


.48 


.50 


.49 


.32 


Asian 


606 


.57 


.59 


.54 


.55 * 


.52 


. 16 


Black 


583" 


.53 


.56 


.56 


.56 


.53 


.30 


Hispanic 


445 


.50 


.55 


.49 


.53 


.50 


.28 




5,236 


.30 


.35 


.29 


.37 


.32 


.12 



a. Interpretation of these correlations should be made with an awareness that 
they are somewhat attenuated due to the low reliability of the essay criterion ^ 
measure. Coffman (19^6) has estimated that the score reliability of a single 
/^say scored independently by two readers (the case for these data) Is about .38. 
Corrections tor attenuation for reliabili^ty in the criterion would increase a 
correlation of .50, for example, to .81. 















TABLE 2. Correlations 


between the EFT- 


Reading 


Score and ATP 


^ Scores^ 










ATP^ 


Scores 






Group 2 




TSWE 




SAT-V ^ 


SAT-M 


Total 


10,674 


.^67 




-74 H 


.46 


Mjen 


4,766 


.64 




.72 


.46 


Women 


5,908 ' 


.69 




.75 X 


.50 


Asian 


"^606 
% 


.73 




.80 > 


'.35 


Black ^ 


5§3 


.66 




.76 


.51 ' 


Hispanic 


445 


.66 




.73 


' .46 


Whit^ 


5,2;36 ^ 


.55* 




.65 


.36 


• a. The Admissions 
SAT-V, SAT-M, and^TSWE 


Testing Program 


of the College Board, 
1 


represented here 


by 



TABLE 3. 


Correlations ' between 


the 


> " ^ a 
EPT-Sentence Construction Scare and ATP 


Scores 


* 






ATP^ Scores 


• 




Group 


N 




TSWE 


SAT-V ■ 


SAT-M - 

/ 


Total 


- — ■ • — — ■ 1 

10,674^^ 


< 


7 A 


70 ' ' " 


' .49 


Men 


4,766 . 




72 


.68 


/ .51 


Women 


. 3,908 




'.76 


.72 , * 


.54 


-A^-an — . 606 




..77 


.78 


.39 


Black 


583, 








A 

.54 


Hispanic 


445 




:7i 


.72 ~^ 




White 


5,236 




.66 ^ 


^60 


.39 


a . 


The Admissions Testing 


Program of the College Board 


, represented here by 


SAT-V, SAT-M, and TSWE. 










• 


f 










TABLE 4. 


f 

Correlations -ibetween 


the 


EPT-Logic and Organiz^ition Score and ATP 


Scores 








ATP^ Scores ^ v 






Group 


N 




TSWE 


SAT-V ' 


SAT-|l 


Total 


10,674 




' .67 V 


.71 


' .52 


Men 


4,766 




.64 


.70 . - 


^^.53 


Women 


5,908 




.69 


.73 


.54 


Asian 


y606 




.73 


.77 


.38 


Black * 


V 583 




.68 . 


.77 . , 


. .60 


Hispanic 


445 




.64 


.-71 


.51 


Whice 


5,236 




.55 


.61 

4 


.4.2 


^- 

a. 


The Admissions Testing Program of the College Board 


, represented here by" 



SAT-V, SAT-M, and TSInFE. 



TABLE 5. Cprrelatlong between EPT -Total Score and AJP^ Scores 



Group 



ATP^ Scores 



N 



TSWE 



SAT-V 



Total 

vWomen 
Asian 
Black 
Hispanic 
Wblte 



10,i74 
4,766 
5,9a8 

606 
583 
445 
5,236 



.76 
.74. 
.77 
.79 
.77 
.74 
.68 



.77 
.76 
.79 
.82 
.80 
.78 
.68 



.51. 
.54 
.56 
.37 
.57 
.52 
.40 



a. The Admissions Testing Program of the College Board, represented here by 
SAT-V, SAT-M, and TSWE. - 



TABLE 6. Predicting the^ 
ATP^ Scores 




-Essay Score and EPT-Total Score from Multiple EPT and 



Group* 



Multiple Correlation, 
EPj j i ipssay Score and: 



EPT 



ATP 



Multiple Correlation, ^ 
EPT-Total Score and ATP 



a. Reading, Sentence Construction, and Logic arid Organization. 

b. TSWE, SAT-V, and SAT-M. 



Total 


-t 

10,674 


.51 


.51 


.83 




Men 


4,776 . 


.50 


.50 


.82 




Women | 


5,408 


.53 


.53 


.84 




Asian 


606 


.61 


,58 


.86 




Black 


583 


.60 


.60 


.86 




Hispanic 


445 


.57 


.56 


.84 




White 


5,236 


.27 


.39 


.75 





d(|Kributed very little to the multiply correlation coefficient - usually only about • 
one additional hundredth. The final column of Table 6 shows the substantial rela- 
tionship between the EPT-Total score and the composite of the ATP components. 

Regression Analyses * , ^ 

The above correlational analyses demonstrate the degree of relatiojnship among the 
several tests, under the assumption that relationships are linear. An examination 
of the nature^of these linear relatiqnships , however, requires analyses of the re- 
gression systems related to the correlations. 

, Comparisons of regression systems for different groups were conducted through 
analyses of covariance. Under assumptions of linearity, regression lines are 
parallel (i.e., the slopes are equal) when no interaction i6 inferred between 
groups and predictors. That is, a unit increase in the predictor produces a pro- 
portionate increase in the criterion regardless of group membership. Second, if 
the lines are coincident (i.e., the slopes and intercepts are equal) the equation 
for one group is essentially the same as the equation for the-'other groups. The 
statistical procedures and other «ietails of the covariance analyses are given in 
Appendix / I 

The resi^lts^of the covariance analyses are givgn in Table 7. In every ethnic com- 
parison with whites on either TSWE or SAT-V, statistically significant interactions 
were found (p " .01).. However, the small changes in the multiple R's shown in 
Table 7, as group and interaction influences .were added, suggest that the interaction 
effects were small. When the nonessay components of the EPT were used to predict 
scores on the EPT-Essay, no significant interactions were found for blacks and whites, 
but tests of coincidence of the two regression lines showed significant differences 
(p < .01). This suggests that the white regression overpredicts for blacks through- 
out ^the range of EPT scores. A similar pattern is evident for prediction of Hispanic 
scores. Some interaction is presetit, but the white regression tends to overpredict 
Hispanic performance. The pattern for prediction of Asian scores is the least dis- 
tinct, although significant interactions do exist. Again, the multiple R's suggest 
little influence of group or interaction factors. 

The r^^lts for\he regression comparison^ of men and women are quite clear. No 
interaction exists for TSWE, SAT-V, or the EPT component predictors. Since the 
slopes are^parallel and alf y|e ^ests of coincidence are significant- (p <' .01), the 
^regression suggests that ^^HW higher at all score levels than men. Thus, the^ 
essay writing performance l^^inn would tend to be underpredicted (and that of men 
overpredic ted) by a linear e^il^)n based on data _for both sexes combined. 



Analyses by Score Levels ^ ' 

* 

While the^ regression comparisons just described wfere not indicative of important 
.group deferences, the statistically s^ignificant^ interactions and intercept differ- 
ences ate intriguing and worthy of further exploration. One approach toward ex- 
ploring the nature of any group differences is to conduct analyses within score 
levels. Such a procedure allows for the detection of nonlinear it ies in the data 
and also provides a sense of the practical importance of group differences. 

Table 8 presents group comparisons of EPT-Essay performance for four different 
score intervals of the JSWE. The percentages shown at the bottom of Table 8 are 
the observed proportions of group members within a given score* range who wrote 
better-than-average essays. Tests of significance were based on the percentages 
for the total frequencies at the left. Thus, for example, it would be expected that' 
70.3 percent of those in the sample scoring 50 or above on the TSWE would write 
bett,er-than-average essays. The tests of significance show that women scoring 50 
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Stable 7. 


CoVariance Analyses 


1 












Groupjs 
Compared 


Regressions 
Compared 


N 


R 

Using* 
Basic 
Predictor 


R 

Adding 
Group 


Add ing 
Inter- 
action 


F 

for Inte,r- 
a^tion (Test 
of Slope) 
t 


F 

for Coinci- 
dence^ (Test 
of Intercept) 



Blacks vs. whites 


Sssay 


on 


TSWE 


5,819 


.44 


.45 


.46 


*25 


4* f 


- 




Blacks vs. whites 


Essay 


on 


SAT-V ' 


5,819^ 


. 40 


, 41 


. 42 


42 


8* 






Blacks vs. whites 


Essay 


on 


EPT-R 


5^^*< 


.40 


, .40 


.40 


1 


8 


15.0* 




Blacks v$. whites 


Essay 


on 


EPT-SC 


5,819 


.44 


.44 


.44 


1 


3 


11.9* 




Blacks vs. whites 


Essay 


on 


EPT-L&O 


5,819 ' 


.40 


' .40 


.40 


1 


9 


8.4* 




HispanicS vs. whites/ 


Essay 


on 


TSWE 


, 

5,fe8l 


.41 


.41 


.42 


19 


6* 


- 




Hispanlcs vs. whites 


» Essay 


on 


SAt-V. 


5, 681 


. 36 


. 37 


. 38 


28 


6* 






Hispanlcs vs. whites 


Essay 


oa 


.EPT-R 


5,681 


.35 


.35- 


.36 


5 


6 


18.5* 




Hispanlcs^' va. vhites 


Efesay 


on 


EPT-SC 


• 5,681 


.40 


.40 


.41 


5 


1 


13.5* 




Hlspanics vs. whites 


Essay 


on 


EPT-L&O 


5,681 


.34 


.35 


.35 
* 


3 


8 


15.3* 




Asians vs. whites 


Esdray 


on 


TSWE 


5,842 


.4> 


.42 


.42 


18 


0* 


- 




Asians vs/ *Vhit*es 




on 


SAT-V 


5,842 


Ah 


.37 


* ^ .38 


25 


It 






Asians vs. whites 


E^say 


on 


EPT-R 


5,842 ' 


.37 


.37 


.37 


11 


5* 






Asians vs. ^i^hites 


Essay. 


on 


EPT-SC 


5,842 


.41 


.41 


.41 


4 


9 


7.5* 




Asians vs. whites 


Essay 


on 


EPT-L&O 


5,842 


.35 


.36 


.36 


7 


2* 






Blacks vs. Hispanics 


Essay 


on 


TSWE 


1,028 


.56 


.56 


.56 


<1 




<1 




Blacks vs. Hispanics 


Essay 


on 


SAT-V 


1,028 


.53 


.53 


.53 


<1 




1.0 




Blacks ^s. Hispanics 


Essay 


on 


B^T-R 


1,028 


.52 


.52 


.52 


<1 




1.2 ' 




Blacks vs. Hispanics 


Essay 


on 


EPT-SC 


1,028 


.56 


.56 


.56 


:t 




<l 




Blacks vs. Hispanics 


Essay 


on 


EPT-L&O 


1,028 


.53 


.53 


.53 






2.3 




H||^ vs. women 


Essay 


on 


TSWE 


10,674 


.'49 


.51 


.51 


^1 




]71.1* 




Men vs. women 


Essay 


on 


SAT-V 


10,674 


.45 


.49 


.49 


^1 




279.3* 




Men vs. women 


Essay 


on 


EPT-R 


10,674 


.46 


.49 


.49 


<1 




224.4* 




Men vs . women 


Essay 


on 


EPT-SC 


10,674 


.49 


' .52 




<1 




193.0* 




Men vs . women 


E^s^y 


on 


EPT-L&O 


10,674 


.46 


.49 


^ .49 


<1 




242.9* 





a. Where dashes are indicated, the standard practice was followed of not performing tests of coincidence 
when .the interactions are significant. 

*;£<.01 (Although significance ^occurs among the groups, note that little increase in R was found. This 
may be due to slight nonlinearity of the data ^ong the first three groups and, in the last group, to the 
very large N.) 
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TABLE 8. E!^say Writing Performance by Group^and TSWE Score Level 



TOI.Tt? 

Tbwt 

Score 




Total 




Men 


i 

Women ^ 


AlKan 


* 

Black 


Hispani<* 


White 








Frequencies Scoring 


1 n Four TSWF 














50+ 




' 2,855 




1,171 ^ 


1,684 


87 


33 


35 


1 


7 R 




40-49 




^ 3,914. 




1,724 


2 , 190 


206 


113 


l46 


9 


1 1. 1 




JO- 




2,659 




•l',300 


1,359 


176 
ff 


196 


1 


1 
J- 


i Jl J 




Below 


30 


1,246 




' 571 


675; 


137 


♦ ?4 1 






252 








Frequencies Writing 


Above-Average Essays 












5CH: 


^^-\^,006 




732* 


;,284 


51 


- - . 

23 ' 


26 


1 


\ 

zzy 




40-49 




2,087 




776 


1,311 


101 


\ 

40 


ou 


1 


, i JD 




30-39 




850 




341 


509 


60 


32 




408 




Below 


30 


167 




58 


109 


15 




1 4 




58 








Percencag 


2S Writing 


Above-Averag 


e Essays 














i 




3 


' 1f2.5* 


75.6^ 


58.6'^ 


69.7 


74.3 




70 


.7 


40-49 




53 


3 . 


45.0^ 


59.9^ 


49.0 


54.9 


54.8 




53 


.3 


30-39 




32 


0 


26.2* 


37.4^ 


34.1 


20.4* 


20.1* 




36 


6* 


Below 


30 


13 


.4 


' "''iO.2* 


16.1'^ 


10.9 


9.1* 


13.3 




23, 


0* 



*Statistically significant (p .05) deviation from expected percentage. 
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or above on TSWE wrote significantly more above-average essays (75.6 percent) than 
expected and that men and Asians who scored 50 or above on the TSWE wrote signifi- 
cantly fewer above-average essays than expected. The differences between men and 
women, moreover, occurred at all four TSWE score levels. Blacks and Hispanics 
scoring below AO on TSWE tended to wr^te fewer above-average essays than expected, 
though the 13.3 percent figure for Hispanics in the lowest (below 30) range was not 
statistically significant. ^ ' 

Conversely, whites scoring below AO oh the TSWE tended to write more above^ 
average essays than would be expected for people in those score ranges. The 
analyses represented in Table 8 are useful as explanations of the interaction and 
intercept differences reported in Table 7. The differences between men and women 
are consistent and are highlighted by Table 8, whereas the multiple R's ^£Jable^7 , 
trended to mask then^ / ^ ^ 

. X'fable 9 shows similar analyses for the 5AT-V as a predictor, t^e pattern is 
quiie sim?^lar to that observed for the TSWE in Table 8. Women consistently writre ' 
better essays than men in the same SAT-V score intervals, and whites who sx:ore below 
AOO on the SAT-V write better essays than blacks and Hispanics with SAT-V scores in 
the same intervals. * 



ERIC 



Tables 10, 11, and 12 present comparable analyses fo>-EPT-Reading, EPT-Sentence 
Coji^truct ion, and EPT-Logic and Organization as predictors of essay writing j^rform- 
ance. Like the TSU^ and SAT-V, the EPT components ' tend to underpredict essay per- 
* formance for women and to overo^^ict for men. And blacks and Hispanics scoring in 
the highest EPT intervals tend^to write fewer above-average essays than would be ex- 
pected from their multiple-choice test performance. 

Tables 8 through 12 may also b^ considered as analyses of false negatives and • 
faj.se positives. Those scoring low on a predictor but writing above-average essays 
are false negatives, and. those scoring high on a predictor but writing below-average 
essays are false [Positives, In Table 8, for example, the highest proportions of 
faJl^se Ti^gatives predicted by TSWE are for women and whites, and the lowest propor- 
tions for men, blacks, and Hispanics. In Table 9 rtie same pattern is shown with the 
SAT-V\as a predictor. Tables 10 through 12 also show this pattern. One should note, 
however, that what* is "low" for the EPT components is different from what is low for 
SAT-V ancf TSWE.* The '^averafge score for the EPT components is about Jl50, so that most 
of the Ugure&^ in Tables 10^ 11, and 12 are for relatively low EPi; scores. 

^ Another way of viewing the issue'of false negatives is to consider it m con- 
nection with the EPT-Total score. Even though the correlational compar isons pre- 
sented previously show substantial relationships between the EPT-Total sc^e^"d 
the other measures, such analyses do not examine performance by group withi%spe- 
cific score ranges.^ Since the previous analyses indicated few differences among the 
various tests, it is enoug-h to limit this l^st comparison to one of the tests. Table 

' 13 presents a comparison of TSWE and EPT-Total scores for all the groups,^ This table 
illustrates the strong relationship between' these two tests, which was demonstrated 
before by the high (.76) correlation reported in Table 5. For those students scoring 
50|or above on the TSWE, 96.0 percent were above average (150) on the EPT-Total score. 

^Jsjfi^x.^ are some variations across groups, with only 87.9 percent of the black sample 
With high TSWE scores scoring above average on the EPT-Total, but these differences 
are not statistically significant. At the other extreme of the TSWE distribution 
(below 30), only 6.9 percent of those students entering CSUC in 1977 obtained above- 
average EPT-Total scores. Blacks scoring below 30 on the TSWE seem least likely to 
perform well on the EPT, since only 2% 5 percent of blacks scoring below 30 on the 
TSWE had ahpve-average EPT-Total ' scores . These comparisons of EPT-Total^scores and 
TSWE scores lead to essentially the same conclusion as the previous analyses. The ^ 
smallest proportions of false negatives tend to occur for the black and Hispanic 
groups. The one noticeable difference between the pattern of Table 13 and that of 
other comparisons is that there wer^ no gtatistica;iy significant dif f erences'-betwecn 
men and women. 
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TABLE 9. Zss^y Writing Performance by Group and SAT-V Score Level 



SAT-V 
Score 



Total 



•Men 



Women>^ Asian 



Black 



Hispanic 



White 



500+ 

400-490 

300-390 

Below 300 
« 

500+ 

400-490 

300-390 

Below 390 

\ 

500+ ' 
400-490 
300-390 
Below 300 



Frequencies Scoring in Four SAT-V Ranges 
2^378 1,093 .1,285 .85 

4;246 2,000 ^,246 208 

3,076 1,330 ,1,746 196 

974 ^*343 631 117 



Frequencies Writing Above-Average Essays 



1,640 
2,257 
1,105 
108 



651 
876 
353 
27 



9S9 
'^1,381 

752 
81' 



53 
103 

66 
11 



18 
93 
238 
2 34 

10 
50 
70 

17 



Percentages Writing Above-Average Essays 

69.0 ' 59.6^ 77.0^ 67.3 55.6, 
53. 2 43:8^^ 61.5^ 49.5 53.8' 
35.9 26.5^ 43. l'^ 33.7 h^^-^ 

11.1 .7.9^ " 12.8 9.4 7.3* 



38 
126 
191 

90 

30 
62 
56 
7 



78.9 
49.2 

7.8 



^Statistically significant (p < .05) deviation 'from expec^d percentage. 



1,442 
2,312 
1,315 
147 



1,004 
1,256 
543 
34 



69.6 
53.8 
41.3* 
23»'l*' 



^ J 
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TABLE 10. Essay Writing Performance by Group and EPT-Reading Score 



EPT- 

Rieading 

Scor'e Total Men Women Asians Bl-acks Hispanics Whites 





Frequencies 


Scoring in 


Four EPT-Reading Score Rang 


es 








150+ 


6,522 




2,934 3,588 




272 


132 




165 




3,773 




140-14.9 


2,978 




1,352 1,626 




192 


203 




191 




1,262 




130-139 


' 716 




294 


422 




75 


120 
« 




52 




168 




Below 130 


'458 




186 


272 




67 


128 




37 




33 






Frequenc ies 


Writing Above-Average Essays 










\ 




150+ 


3,839 




1,449 2, 


390 




164 


64 




92 




2,267 




140-149 


1,116 




• 

407 


709 




60 


oU 




54 




521 




1 OA 1 OQ 


133 




47 ^ 






1? 


13 




7 




43 




Below 130 


22 




4 


18 




2 


5 




2, 




6 






Percentages 


Vriting Ab^ 


>ve-Average Essays 














150+ 


58 


9 


49.4* 


66 


6* 


^.0.3 


48 


5* 


55 . 


B 


^ 60. 


1 


140-149 


37 


5 


30.1* 


43\6*= 


31.2 


29 


6-> 


28. 


3 


i41 


3 


130-139 


18 


6 


46.0 • 


20 


.4 


. 22.7 


10 


8* 


13. 


5 


23 


6^ 


Below 130 


4 


8 


2.2 , 


6 


.6 


3.0 


3 






4 


18 


2-k 



*Stat isticaily significant (p, .05) deviation from expected percentage. 
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lliBLE 11. Ess^y Writing Performance by Group^ and EPi;-Sentence Construct ion Score 



EPT-Sentence 
Construction * 
Score Total 



150+ 
140-149 
130-139 ' 
Below 130 

150+ 
140-149 
130-139 
Below 130 

1^0+ 
140-149 
130-139 
BeTow 130 



Men 



Women 



Asians BXac^ks Hispanics Whites 



Frequencies Scoring in Four EPT-Reading Score Ranges 
7,050 3,092 3,958 307 166 200 

2,279 1,083 1,196 154 163 131 

900 ' 410 • 490 77 ' 142 78 

445 181 264 68 112 "36 

Frequencies Writing Above-Average Essays ' 

4,171 1,571 2,600 170 . 84 106 
758 279. 479 47 44 38 
155 49 166 9 14 9 
26 8 , 18 ^ 7 * 5 , 2 
Percentages Wrj.ting Above-Average Essays 



59.2 


50 


4^ 


^64.2-;^ 


55.3 


50.6^ 


53.0 


33.3 


25 


8* 


)r/A6.9* 


30.5 


27,0 


29,0 
















17.2 


12 


0^ 


. 22.8* 


U.7 


9.8* 


11.5 
















5.8 


4 


4 




10.3 


4,5 


5,6 



4,006 
979 
203 
48 



2,527 
330 
53 
5 



63.1* 
35.8 
26.1* 
10,4 



*Stat istically significant (p < ,05) deviation from expected per,centage. 



14 



J 



20 



TABLE 12. Essay Writing Performance by Group and EPT-Organizat ion and Logic Score 



150+ 

140-149 
130-139 
Below 130 



57.7 


48.4^ 


65.3^ 


49, 


7* 


51.0 


48 


2* 


37.5 


30.3* 


43.4^ 


3i 


1 


34.9 


' 33 


1 


16.8 


11.1* 


20.8* 


17 


7 


8.9* 


4 


9* 


6.6 


4.2 


8.3 


6 


2 


6.1 


13 


6 



*Statistically significant (p < .05) deviation from expected 

i 



EPT-Organization 
and Logic ^ - 
Score 


Total 


Men 


Women 


Asians 


Blacks 


Hispanics 


Whites 




Frequencies Scoring 


in Four 


EPT-Reading Score Ranges t 


- 


150+ 


7,114 


3,210 


3,904 


344 


143 


201 


4,024 


140-149 


2,237 


1,015 


1,222 


135 


152 


139 


yoi 


130-139 


781 


325 


456 


62 


123 


61 




Below 130 


542 


216 


326 


65 


165 


44 


AO 




Frequencies Writing Above-Average Essays 








150+ 


4,105 


1,554 


2,551 


171 


, 73 


97 


2,390 


140-149 


838 


308 


530 


42 


53 


46 


396 


130-139 


131 


36 


95 


\ n 


11 


3 


44 


Be'low 130 


36 


9 


27 


4 


10 


6 


7 




Percentages Writing 


Above-Average Essays 









59.4 
4K 
^.8 
14.3 




TABLE 13. EPT-Total Score Performance by Group and TSWE Score Level 
, 

TSWE 
Score 



Range 






Asian Black 


H i^D^n ir 


Whi te 
» 






Frequencies in Four TSWE Score 


R3nge& 








50+ 




2,861 1,175 'l,686 


1 * 

87 


33 


35 * 


1,743 


40-49 




3,917 1,725 2,\92 


i06 


113 


147 


2,131 


30-39 




2,666 ^ 1,304 1,362 


176 


196 


160 


1,118 


Below 


30 


1,275 587 688 


148 


243 


107 


25A " 






Frequencies with Above-Av^tage 


EPT-Tocal 


Scores 






50+ 




2,748 1,119 1,629 


85 


29 • 


32 


*1,683 


40-49 




3,000 1,290 1,710 


- 143 


58 


102 


1,695 


30-39 




972 449 5^3 


60 


33 


38 


517 


Below 


30 


88 38 50 
Percentages with Above-Average 


. J 5 

EPI-Xotal 


6 

Scores 


5 


38 


50+ 




96.0 95.2 96. 


6 _ 97. 


7 87.9 


91.4 


96.6 


40-49 




76.6 74.8 78. 


0, ^ 69. 


4^ 51.3^ 


69.4 


79.^ 


30-39 




36.4 34.4 38. 


4 34. 


1 * 16.8^- 


23.8* 


46.2*- 


Be 1 ow 


30 


6.^ ^.5' 7. 


3 * m. 


4 '2.5'- 


A. 7 


15.0* 




-Sxacist icallj^^4:^^nii leant (p ' .05) 


deviat ion 


f rem expected 


percentage. 




1 




• 

* 


f. 




« 

* 
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SUMMARY 



Seven cescs were compared and contrasted wich respecc Co incerrelacionships for six 
differenc groups. Special emphasis was placed on che use of an essay. The EPT-Essay 
score was used as a common cricerion for making comparisons of che EPT ncmessay coin- 
poaents with che College Board's SAT-V, SAT-M, and TSWE. The EPT-Tocal score was 
also used as a cricerion for some comparisons. Correlation , regression, and score- . 
interval analyses 'were conducted. 

The correlational analyses demonstrated the close relationship among SAT-V, 
TSWE, and EPT scores. There were f ew'correlat ional differences across groups, with 
the exception <hat the white sample consistently yielded lower correlations for all 
comparisons than did .the other groups. Whether the relationship being examined 
was wiDh che EPT-Ess^y score, the EPT-Reading score, the EPT-Sentence Construe- ' 
tion score, the EPT-Logic and Organization score, or with the EPT-Total -score, 
the correlation for che whice sample was always che lowesc . The lower whace corre- 
lacions were accribuced co accenuaced variances in che, whice s:ciraple for mdsc scores. 
A second incerescing observacion was chac of che relacionship becween che success of 
mulciple corapQnencs* of che EPT and College Board's cescs as pre^^iccors with che 
EPT-Essay score as a common ,cricer ion. Boch sees of componencs predicced EPT-Essay 
performance" equally well for alF groups* Finally, che EPT-ToCal score was seen co 
'bet-^'well predicted by che mulciple set, at College 'Board scores (SAT-V, SAT-M, and 
TSWE), even chough che SAT-M concribuced onlv minimally co chis prediccion.^ 

The regression analyses indicaced che nacure of relac ionships among che various 
cesc scores. The EPT-Essay score was used as a common cricerion, and all groups were 
compaced and concrasced in regard to che naCure of che prediccive relacionship for 
'each of che nonessay componencs scudied. Boch che TSWE and che SAT-V were seen co 
overpredicc m^noricy performance in essay wr icing. Similarly, ^che EPT nonessay 
componencs cended Co overpredicc minoricy performance in essay wricjng. The obser- 
vacion of overpredic cion of minoricy performance has been a common one in previous 
s-Cudies of all types of cescs. The regression comparisons for men and women also 
followed a patcern ofcen observed in previous"^ scudies. ThroughouC che score ranges 
for TSWE, SAT-V, and all chree EPT nonessay components, Lomen were consiscencly 
underpredicced. In ocher jwords, women consiscencly per/irmed beccer on essay wricing 

casks Chan che cesc scores in chis analysis indicace. 
y 

The*«core incerval analyses fu^cher illuscraced che prediccive^ relacionships. 
The E^T-Tocal score appeared co be quice closely reiaced co TSWE scores, as 96 per- 
cenc of^ chose scoring 50 or above on TSWE obcained above-average EPT-ToCal scores, 
buc only 7 percenc of chose scoring below 30 on TSWE obcain^^d 'above-average EPT-ToCal 
scores. 'These descr ipcive. comparisons were similar for all groups. Discribuc ions 
were also used co decermine the proporcions of false negacives for differenc groups 
in differenc TSWE score ranges, using boch che EPT-Essay and the EPT-ToCal score as 
criceria. These analyses showed chac blacks and Hispanics tended Co have che smallesc 
repres^ncac ions of false negacives, while women had che mosc. The resulcs 'obcainecj 
were in agreemenc wich similar, previous scudies. One incerescing new finding was 
observed, however. Alchough women consiscencly ouC-performed men on essay wricing, 
Chey did noc do so when Che cricerion consisced primarily of mul ciple-choice measures, 
•as in che EPT-Xocal score. 
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TABLE A-1. Intercorrelations of EPT and ATP Components for total Sample (N=lv 



Test 



Mean 



Standard 
Deviairion 



\ 

6 7 



1. 


EPT-Essay 
* 


7. 


31 


1. 


85 


1.00 . 


.46 


.49^>A 


46 


-71 


.45 


. 23 


.49 


2. 


EPT-Reading 


150, 


17 


9. 


14 


.46 


1.00 


.76 


79 


.89 


.74 


. 46 


.0/ 


3. 


EPT-Sent;^nce 
Construction 


150 


40 


9 


32 


..49 


■ .76 


1.00 


76 


.89 


.70 


*.49 


.74 


4. 


EPT-I,ogic 


150 


10 


, 9 


36 


^ .46 


.79 , 


,76 1 


.00 / 


.89 


.72 


• .52 


.67 


5. 


EPT-Total 


150 


18 


7 


84 


* 

.71 


.89 


.89 


.89 


1.00 


.11 


.51 


.76 


6. 


SA*r-V^ 


42 


42 


9 


.50 


.45 


.74 


.70 


.72 


.77 


1.00* 


.54 


.72 


7. 


SAT-M^ 


47 


40 


10 


64 


' .23 


.46 


.49 


.52 


,5l' 


.54 


1.00 


.49 


8. 


TSWE 


42 


.44 


9 


.95 ' 


,49 


.67 


.74 


.67 


.76 


.72 


.49 


1.00 



a. Note that the scale used for these purposes has been truncated to 20 to 80, rather than 
the usual 200 to 800 scale for reported* scores. * 



^0 



TABLE A-2, Intercorrelations of EPT ^nd ATP Components for Men (N=4^766) 



.Test 



Mean 



Standard 
Deviation 1 



1. EPT-Essay 


6.95 


1 


.83 


1.00 


.43 


.48 


.45 


.71 


.44 


.29 


' .48 


2. EPT-Reading 


150. '22 


8 


.85 


..43* 


1/00 


.74 * 


.78 


.88 


.72 


•^6 


.64 


























3. EPX-Sentence 
























" Construction 


IjU • 1 J 


Q 

y 


. Do 




. /4 1 


.00 


.74 


Q Q 
. 00 


£. Q 
. DO 






4. EPT-Logic 


150:33 


9 


.10 


.45 


.78 


.74 


1.00 


.88 


.70 


.53 


.64 


5\v^T-Totai 


149.'73 


7 


.56 


.71 


.88 


.88 


.88 


1.00' 


.76 


.54 


.74 


6. SAT-V^ 


42.96 


9 


.21 


.44 


.72 


.68 


.70 


.76 


1.00 


'.53 


.71 


7. SAT-M^ 


50.83 


10 


.6a 


.29 


.46 


.51 ^ 


.53 


• .54 


.5*3 


•1.00 


.51 


8. TSWE 


41.94 


9 


.82 


.48 


.64 


»1 


.64 


. .74 


.71 


.51 


1.00 


a. Note that 


the scale 


used 


:or these 


purposes 


has been 


truncated to 


20 to 80, 


rather 


than the 





usual 200 to 800 scale for reported scores^. 
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TABLE A-3. Intercorrelations of EPT and ATP ^Components for Women (N=5,908) 



Test 


Mean 


Standard 
Deviation 




2 


3 


4 


5 




6 


7 


'8 


1. EPT-Essay^ 


7.60 


1.82 


l.QO 


.49 


,51 


.49 




72 


.49 


.32 


.50 


2. EPT-Reading 


150.12 ' 


9.37 




1.00 


,78 


,81 




.90 


.75 


.50 


.69 


3. EPT-Sentence 
Construction 


150.63 


9.50 


.51 


.78 
» 


1.00 


.78 




.90 


.72 


.54 


.76 


4. EPT-Logic 


149.92 


' 9.56 


.49 


.81 


.78 


1.00 


/ 

/ 


.90 


.73 


.54 


.69 ^ 


5. EP?-Total 


150.54 


8.03 


.72 


.90 


.90 


.90 


f 

\ 1 


.00 


.79 


.56 


.77 


6. SAT-V^ 


41.98 


97.01 


.49 


.75 


.72 


.73 




.79 


1.00 


.57 


.74 


7. SAT-M^ 


*44.63 


97.56 


.32 " 


.50 


.54 




.56- 


.57 


1.00 


.54 


8. TSW^ 


42.85 


10.05 


.50 


.69 


.76 


.69 




.77 


.74 


.54 


1,00 



a. Note that the scale used for these purposes has been truncated to 20 to 80, rather than the 
usual 200 to 800 scale for reported scores* 



TABLE Intercorrelat ions of EPT and ATP Components for Asian Sample (N=606) 



Standard 

,Test Mean Deviation 1 ,2 3 4 5 6 7 '8 



1. EPT-Essay 


6 


82 


1 


93 


1.00 


.57 


.59 


.54 


;75 


.52 


.16 


.5S 


2. EPT-Reading 


145 


82 


11 


14 ^ 


.57 


1.00 


.82- 


.85 


.92 


.81 


.35 


.73 


3. EPT-Sentence^ 
Construction 


146 


23 


11 


39 


.59 


.82 


1.00 


.83 


.93 


« .78 


.39 


.77 


4. EPT-Logic 


147 


08 


11 


21 


.54 


.85 


.83 


1.00 


.92 


.77 


.38 


.73 


5. EPT-Totai 


146 


68 


9 


61 


.75 


.92 


.93 


.92 


1.00 


.82 


.37 


.79 


6. SAT-V^ 


38 


90 


9 


89 


.52 


.81 


.78 


.77 


;82 


1.00 


.46 ' 


.76 


7. SAT-M^ 


49 


27 


9 


81 


; .16 


.35 


.39 


.38 


.37 


.46 


1.00 V 


.41 


8. Tsm 


38 


16 


10 


47 


.55 


.73 


.77 


.73 


.79 


.76 


.41 


1.00 * 



a. Note that the scale used for these purposes has been truncated to 20 to 80, rather than the 
usual 200 to 800 scale for reported scores. 




TABLE A-5. Intercorrelap ions of EPT and ATI^ Components for Black Sample (N=583) 

9 



Standard 

Test Mean' Deviacion 1 2 3 ^ 



1 . EPT-Essay 


6, 


20 


1, 


86tfVk 


> 

1.00 .53 




57 


.56 




.53 


.30 


.56 


2. EPT-Reading 


139.91 


11. 


58 


.53 1.00 




76 


.80 


.90,^^ 


.76 


.51 


.66 


3. EPT- Sentence 
























.75' 


« Construccion 


UO 


68 


. 11. 


25 


-.57 .7^ 


1 


00 


.75 


.89 


.71 




4. EPT-logic 


138 


31 


12 


24 


.56 ^ .80 




75 


1.00 


.91 


,11 


.60 


.68 


5. EPT-Tocal 


140 


84 


9 


65 ^ 


".74 ' .90 




89 


.91 


1.00 


,80 


.57 


.77 


6. SAT-V^ 


• 32, 


40 


8 


25 


.53 .76 




.71 


.77 


.80 


1.00 


.61 


^.69 


7 . SAT-M^ * 


34 


84 


7 


91 " 


.30 .51 




.54 


' .60 


.57 


.61 


1.00 


.55 


8. TSWE 


32 


79 


9 


46 


.56 .66 




.75 


.68 . 


,11 


.69 ' 


.55 


1.00 






J 




















a. Note thac 


Che scale 


used 


for chese 


purposes has 


been 


« 

truncated co 


20 CO 80, 


racljer 


than 


the 



usual 200 to 800 scale for reported scores. > 
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TXBLE A-6. Intercorrelations .of EPT and ATP Components for Hispanic Sample (N=445) 



Standard 

Test *flean ' Deviation 1 2 3 4 ' 5 6 7. 



1. 


EPT-E;ssay 


6 


60 


1.94 


1.00 


. 49 




55 


. 49 




74 


. 50 


0 o 
. LO 




2. 


EPT-Reading 


145 


61 


9.93 


^.49 


1.00 




75 " 


*.77 




88 


.73 
1 


.47 


^.66 


3. 


EPT-Sentence 
Construction 


145 


53 


10.30 


.55 


.74 


1 


00 


.76 




90 


.72 


.53 


.71 


4. 


EPT-Logic 


145 


.34 


10.49 


.49 


.77 




76 


1.00 




89 


.71 


.51 


" .64 


5. 


EPT-Total 


145 


74 


8.62 


.74 


.88 




90 


.89 


1 


00 


. .76 


.52 


.75 


6. 


SAT-V^ 


37 


02 


8.45 


.50 


.73 




72 


.71 




.78 


1.00 


^ .50 


.70 


7. 


SAT-M^" 


40 


13 


8.90 


. .28 


.47 




53 


.51 




52 


.50 


1.00 


.45 


8. 


TSWE 


36 


.74 


9.40 


.53 


.66 




71 


.64 




75 


.70 


.45 


1.00 




a. Note that 


the scale 


used for 


these purposes 


has 


been 


truncated to 


20 


to 


80, rather 


than 


the 



usual 200 to 800 scale for re{>orted scores. 
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TABLE A-7. Intercorrelat ions of EPt and ATP Component s*f or l^ite Sample (N=5,236) 



Standard 



Test , 


Mean 


Deyiation 


1 ' 


2 


3 


• 


4 




5 




6 


7 


8 


1. EPT-Essay 


7.64 


1.71 


1.00 


.30 




35 


> 


.29 




.67 


.32 


.12 


.37 


2. EPT-Reading 


152.74 


6.6^ 


.30 


1.00 




63 




.66 




.81 


.65 


.36 


.55 


3. EPT-Sentence 
Construction 


152.94 


7.06 


.35 




1 


00 




.64 




.83 


.60 * 


.39 


.66 


4. EPT-Logic 


152.69 


6.58 


.29 


.66 




64 


1 


.00 




.81 


.61 


.42 


.55 


5. EPT-Iotal 


152.51 


5.55 


.67 


.81 




83 




.81 


1 


.00 


.69 
1.00 


.40 


.68 


6. SAT-V^ 


44.72 


8.50 


.32 


.65 




60 




.61 




.69 


.45 


.64 


7. SAT-M^ 


49.32 


9.82 


.12 


.36 




39 




.42 




.40 


.45 


1.00 


.39 


8. TSWE 


45.01 


8.69 


.37 


.55 




66 




.55 




.68 


.64 


.39 


1.00 


a. Note that 


the scale 


used for these 


purposes has 


beeii 


truncated to 


20 


to 


80, rather 


than 


the 



usual 200 to 800 scale for reported scores. ' 
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APPENDIX B. REGRESSION ANALYSES: PROCEDURES AND TABLES 



Previous researeh- comparing regression systems in group comparisons often has not 
specified precisely how the comparisons were made. This note is intended to 
rectify that situation — at least for this report. 

If the regression of a criterion on a test score is bhe^ame for different 
groupfe, the test is an unbiased predictor of the criterion\j>r those groUps. In 
the present study, to determine the validity of predicting tS&"<fe?T-Essay score 
from TSWE, SAT-V, and EPT component scores for various ethnic gro^s and for men 
and women, multiple regression analysis was utilized. 'The procedure is similar to 
the rationale of Gulliksen and Wilks. (1950), which is based on evaluation of the 
quality of errors of estimate, of slopes, and of intercepts. This general 
methodology has been frequently used in test bias studies (e.g., Cleary, 1968; 
Humphreys, 1973; Reschley and Sabers, 1979; Temp, 1971), 

j» , 

The^neral linear model technique of the software provided by t*he Statistical 
Package fox the Social Sciences (Nie, Hull^, Jenkins, Steinbrenner , and Bent, 1975, 
pp. 381-3w) was used because of its convenience and availability" to most re- 
searchers. f Since multiple regression may include dummy variables as predictors, 
pairs of ethnic groups to be compared were used as dichotomous, categorical vari- 
ables. A least squares approach to analysis of covariance similar to the method 
employed by Cleary (1968) permitted testing of effects of the interaction of test 
score by ethnicity and of the effect of ethnicity, holding test score constant. 

The single multiple regression for the two group categories (Z = 0,1), score 
on TSWE (X), the EPT-Essay criterion (Y*), and the interaction between ?ace and 
TSWE (ZX) was developed from the rationale of Kleinbaum and Kupper (1978, Chapter 
13) as follows: 

Y* = Bq + B^X + B^Z + B^(ZX) +>ErrQr, 



For blacks (Z 4 1) the model reduces to: 



Yg* = (Bq + B^) + (B^ + B^)'X + Error; 



and for whites (Z = 0) : 

^r,' = + + Error, 
w U 1 

Thus, one can test the slopes and intercepts of two groups .within a single jmodel. 



Humphreys (1973) has stated that the critical comparisons^ to be made are tests 
of equal slopes and Intercepts. Following his suggestion ther two hypotheses tested 
were from the sift^la^regression ecjuation. 




1. The two regression lines are parallel; that is, H^: B^ = 0, i.e., X is 
proportional from group to group. When B- = 0, tne slope for blacks be- 
comes the slope for whites, 

f 

2. The two regression lines are coincident; i.e., Hq: B2 = B3 ^ 0, i.e., X is 
identical after the differences in groups are removed. When B2 = B3 = 0, 

^ the model for blacks reduces to the model for whites. 

To test hypothesis 1 the following F statistic was computed: 

SS , (X,Z, X2) - ss , (X,Z) 

p(xz|x,z, . 'gs° , .x,z, ^ """" 

residual 
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To test hypothesis 2: 

' [SS , X,Z, XZ) - SS .^^ (X)]/2df 

^ regression regression , 

F(XZ,Z1X) - MS , (X,Z, XZ) 

residual 

If the slopes were significantly different, no test of coincidence of lines was 
conducted. ' 

The multiple for (X, Z, XZ) and its various subsets may also.be used, as it 
represents the proportion of total sums of squares (Overall and Spiegel, 1969). 
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TABLE B-l. Estimates of Simple Regression Parameters - Ethnic Comparisons 



Raw 







Intercept 


Coefficient 


Standardized 




Groups 


EPT-Essay on: 




• 


Coefficient 


S.E. of B^ 


Asian 


TSWE 


2.96 


. 1011 


.547 


.0063 




SAT-V 


2.88 


.0101 


.517 


.0007 




EPT-R 


-7.72 


.0997 


.574 


.0058 




EPT-SC 


-7.82 


.1001 


.589 


.0056 




EPT-L&O 


-6.97 


.0937 


.543 


.0059 


Black 


TSl'/E 


2.59 


.1100 


.560 


.0068 




, SAT-V 


2.30 


.0120 


..535 . ^ 


" .0008 




- EPT-R 


-5.77 


.0856 


.533 


.0056 




EPT-SC 


-6.92 


.0933 


.565 


.0056 




EPT-UO 


-5.47 


.0844 


.556 


>^ .0052 


Hispanic 


TSWE 


2.56 


.1101 


.533 


].0083 




SAT-V 


2.32 


.0115 


.503 


1.0009 




EPT-R 


-7.47 


.0966 


.495 


.0081 




EFT-SC 


-8.49 


. 1037 


.551 


.0075 




EPT-L6.0 


-6.67 


.0913 


.494 


.0075 






4.36 


.0728 


, 370, 


y0025 




SAT-V 


4.74 


.0065 


. 322 ^ 


.0003 




EPT-R 


-4.05 


.0765 


.297 


.0034 




E^T-SC 


-5.47 


.0857 


.354 


.0031 




/PT-LSO , 


-3.87 


.0754 


.290 


.0034 



.-i 



TABLB B-2. Estimates of Simple Regression Parameters - Sex Comparisons 



Groups 




Females 



riaies 



EPT-assay on: 



Intercept 



Raw 

Coefficient 
(Bp 



Standardized 
Coef f icient 



S.E. of B 



TSWE 




3 


72 


.0907 


.500 




.0020 


SAT-V 




3 


76 


.0092 


.48i7 




.0002 


EPT-a 




-6 


70 


.0953 


.490 




.0022 


EPT-SC 


-7 


03 


.0972 


.506 




.0022 


EPT-L6K) 


-6 


28 


.0926 


.486 




.0022 


TSWE 




3 


20 


.0894 


.480 




.0024 


SAT-V 




3 


16 


.0088 


.444 




.0003 


EPT-R 




' -6 


50 


.0895 


.•^4 




.0027 


EPT-SC 


-7 


.69 


.0975 


.485 




.0026 


EPT-L60 


-6 


53 


.089 7 


.446 




.0026 
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